IAES International Journal of Artificial Intelligence (IJ-AI) 


Vol. 9, No. 2, June 2020, pp. 304~309 


ISSN: 2252-8938, DOI: 10.1159 1/ijai.v9.i2.pp304-309 0 304 


Nutrient deficiency detection in maize (Zea mays L.) leaves 


using image processing 


Nurbaity Sabri, Nurul Shafekah Kassim, Shafaf Ibrahim, Rosniza Roslan, Nur Nabilah Abu 


Mangshor, Zaidah Ibrahim 


Faculty of Computer and Mathematical Sciences, Universiti Teknologi MARA (UiITM) Melaka, Kampus Jasin, 77300 


Merlimau, Melaka, Malaysia 


Article Info 


ABSTRACT 


Article history: 


Received Jan 28, 2020 
Revised Apr 8, 2020 
Accepted Apr 25, 2020 


Keywords: 


Color histogram 
Gray-Level Co-Occurrence 
Hu-histogram 

Maize leaves 

Nutrient deficiency 


Maize is one of the world's leading food supplies. Therefore, the crop's 
production must continue to reproduce to fulfill the market demand. Maize is 
an active feeder, therefore, it need to be adequately supplied with nutrients. 
The healthy plants will be in deep green color to indicate it consist of 
adequate nutrient. Current practice to identify the nutrient deficiency on 
maize leaf is throught a laboratory test. It is time consuming and required 
agriculture knowledge. Therefore, an image processing approach has been 
done to improve the laboratory test and eliminate a human error in 
identification process. The purpose of this research is to help agriculturist, 
farmers and researchers to identify the type of maize nutrient deficiency to 
determine an action to be taken. This research using image processing 
techniques to determine the type of nutrient deficiency that occurs on the 
plant leaf. A combination of Gray-Level Co-Occurrence Matrix (GLCM), 
hu-histogram and color histogram has been used as a parameter for further 


classification process. Random forest technique was used as classifiers 
manage to achive 78.35% of accuracy. It shows random forest is a suitable 
classifier for nutrient deficiency detection in maize leaves. More machine 
learning algorithm will be tested to increase current accuracy. 
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1. INTRODUCTION 

Zea mays L. the scientific name of maize is a very important staple food supply in many parts of the 
world from maize itself to cereal. Maize can adapt in different kind of environment and growing in a wider 
area than other major crop such as potato, wheat and soybean [1]. The maize plant can be defined as a 
metabolic system whose at the end of the product will produce variety starch deposited from the maize 
kernels such as cereals or flour. This plant is the most prominent cereal grain all over the world, after rice and 
wheat. It is commonly providing full-bodied nutrients for humans and animals. 

Maize plants are precisely intensive feeders and even soils need to be very fertile to fully supplied 
nutrients as the plant develop. Compared to other single-cell systems, maize has a long-life cycle since it easy 
to grow [1]. As to indications of nutrient adequacy, plants ought to be in deep green colour. Some of maize 
plant facing nutrient deficiency as a result of several agronomic and environment factors. Traditional nutrient 
deficiency diagnostic methods require comprehensive soil or plant tissue laboratory testing or manual 
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examination by farmers [2]. This is a tedious and laborious work, though, and can be achieved only with very 
small samples [3]. Therefore, most of the methods of production are approximate. 

Nutrients deficiency detection important to ensure all maize grown in accordance with what is 
requires such as Nitrogen, Iron and more. This nutrient deficiency in maize visually can be seen through the 
leaves of affected plants using image processing. Moreover, image processing technique can assure that the 
methodology may be reliable and able to be detection tools in agricultural field [4]. The use of image 
processing is more accurate since it can capture the differences of pattern, colour, and the surface that 
affected. Understanding these signs will help determine corrective action to normalize the plant [2]. 
Classification process in image processing consists of features and classifier. It is essential to concentrate on 
the feature extraction stage as it has an observably affects on the competence of the recognition system [5]. 
There are different technique for feature extraction such as texture, color and shape [6]. 

Gabor filter has been used by researcher for various texture analysis applications [7]. Gabor filter is 
intended to test the whole recurrence domain of an image by portraying the center frequency and orientation 
parameters [8]. By deploying of the Gabor filter, image steganalysis improved scheming tool in consequence 
use of the steganalysis [9]. However, Gabor filters have poor execution when the image is fragmented 
numerous smaller texture, in this manner influencing the precision of image segmentation [10]. GLCM one 
of the well know texture features consist of co-occurrence matrices results are better than other texture 
distinct methods [11]. It is a conventional method of texture feature extraction that can be useful for image 
classification, segmentation, recognition and more [11]. Color moment (CM) consider as an effective and 
simple method for color feature [8]. There are overall three step in color moment which are mean, Variance 
and skewness [12]. According to previous article also mention that CM has a compact feature as it is only 
required three color components [12]. However, CM is low in discrimination power according to previous 
article. Color histogram very effective and give actual presentation visualization [12]. In addition, color 
histogram is useful to recognize image. The implemented color histogram approach has proven to be very 
easy and efficient to enforce [13]. Histogram Orientation Gradient (HOG) detects edge or gradient form that 
is very descriptive of local shape and does so in a local representation with only an easily managed degree of 
in-variance of local geometry [14]. However, it takes a long computation time [15]. A fast and very simple 
shape features which is hu moment has been introduce [16]. It is the best method in imge processing 
strategies [17]. Therefore, this research implement combination of GLCM, color histogram and hu moment 
to as a features to be used in classifier. 

Support Vector Machine has been used to detect a leaf disease on grape leaf [18]. However, it is 
difficult to specify the best parameter to use if data is not separate linearly [19]. CNN is one of excellent 
image processing approach in Artificial Intelligent that implement general and detail task. Many CNN 
architectures has been used for image classification and recognition such as AlexNet and LeNet [20]. 
Enhancements in convolutional neural networks (CNNs) recently have made them the best in class among 
machine learning approaches for addressing computer vision issues, especially in image classification [21]. 
This architecture has also been implements on Maize leaf to identify the disease on maize plants and a high 
accuracy achive [22]. However, one main disadvantage of CNN based methods is that they usually need large 
datasets to train a feasible model [23]. Random Forests (RF) is among the most effective and efficient 
machine learning approaches in today's algorithms [24]. Because of their high predictive precision, random 
forests since then have proven to be successful in so many fields [25]. Due to a great success of random 
forest, implementation of this algorithm will be done to classify nutrient deficiency of maize leaf. 


2. RESEARCH METHOD 

Image processing consist of step by step process on classify the nutrient into three classes name 
Nitrogen, Potassium and Magnesium deficiency. Below shows the flow of classication starting from input 
image, preprocessing, feature extraction, and classification. Figure 1 illustrates the leaf detection and 
classification diagram for nutrient classification on maize leaf. 


Pre- Feature 


Input Image Classification Output 


processing Extraction 


Figure 1. Leaf detection and classification 
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2.1. Input image 

In this study, three type of nutrient deficiency which are Nitrogen, Magnesium and Potassium are 
being collected. Up to 30 distinct images of classes of nutrient deficiency mentioned will be processes to 
detect type of nutrient deficiency of the maize leaf. The dataset of maize leaf will be divided into two 
partition which are training and testing. Figure 2 shows example of image of nutrient deficiency. 


(b) (c) 


Figure 2. Image of (a) Nitrogen, (b) Potassium and (c) Magnesium deficiency [26] 


2.2. Preprocessing 

An input image contains of unwanted noise, preprocessing phase is to remove noise and 
enhancement of image. Image Preprocessing is to expel the undesirable noise in image pursued by section 
and smoothing of the image and completed to improve the quality of the image [27]. An image of maize leaf 
captured then resize to 500 X 500 pixels to reduce processing time. The images needed to be enhances before 
go through to the next process. Images are filter using median filter. Median filter is a non-linear process to 
remove unwanted noise and outliers from the input images. 


f(xy) = median {g(s,t)} (1) 


The formula above is the output of median filter where, f (x, y) is the original image and g (s, t) is the output 
image. S is a two-dimensional mask, where the mask size is m X m (where m is usually odd) such as 3 x 3, 5 
x 5, etc. 


2.3. Feature extraction 

Once image preprocessing is finished, it is important to get the most important qualities of the leaves 
for separating them with respect of each deficiency [28]. Process of extracting related information from input 
image is called feature extraction. It is also to transforming input image into a set of features. In the proposed 
methodology, a comprehensive experimentation is carried out considering texture features from the maize 
leaf. 


2.4. Texture feature 

Gray Level Co-occurrence Matrix (GLCM) approach defines the shape of distribution of various 
tones intensities in the image, which are determined by acquiring the cooccurrence matrices of the image. 
GLC®M is created from a gray-scale image. A co-occurrence matrix portrays the frequency at which a specific 
gray level is shown in a particular spatial relationship, in relation to another gray level in an image. In this 
way, the co-occurrence matrix is an outline on how the pixels values are exhibited alongside to another value 
in a small window. Because of their extensive dimensionality, the GLCM's are sensitive to the size of the 
texture samples on which they are assessed. Consequently, the quantity of dark dimensions is frequently 
diminished. The two-dimensional array denoted by P of both rows and columns is signifying to a set of 
possible gray levels in image values. P (i, j | dx, dy) is the relative frequency with which two pixels, separated 
by a distance d (dx, dy). 


2.4.1. Contrast 

It also called as inertia or sum of square variance. The function is to calculate the intensity of the 
contrast. The contrast statistical feature used to measure the local variations in the GLCM and restoring a 
calculation contrast of the intensity between the pixels and their neighbors over the whole image. N is an 
unknown value. Contrast is 0 if the image is constant (dimension of input image size). Equation (2) shows the 
equation for contrast. 
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Yijz0 Pi; (i — f)? (2) 


2.4.2. Correlation 

Correlation texture is a return measure of linear dependency gray level between the predetermined 
pixels at the specified positions relative to those neighbors over the entire image. Moreover, it is provided 
same measurement to autocorrelation method. A correlation is calculated using this formula: 


N-1 p |G eDG- Hi 
teu [ethee 7 


N is an unknown value, is mean based on the reference to the pixels and o is a standard deviation. 
Correlation range -1 is for negative correlative image and | is for positive correlative image. On the other 
hand, for a constant image (dimension of input image size), the correlation is NaN. 


2.4.3. Homogeneity 

The measures of the smoothness homogeneity distribution of the gray level of the image it is 
approximately inversely correlated with contrast. If contrast is small, for the most part homogeneity is 
substantial. If weights decline away from the diagonal, the calculated texture measure will be bigger for 
windows with little difference [29]. Return homogeneity weights values by the inverse of the Contrast 
weight, with weights diminishing exponentially far from GLCM diagonal. N is an unknown value. Range of 
homogeneity is | if it is a diagonal GLCM. Homogeneity feature formula as: 


n-1 _ Pij 
iJ=0 55 Gp? (4) 
2.4.4. Entropy 

Entropy is a statistical measure of randomness that can be utilized to describe the texture of the 
maize leaf image and generally classified as a first-degree measure. The images with a larger number of 
distributed gray levels have bigger entropy. N is an unknown value and In is this logarithm and uses a base 
close to 2.718 same with log with a base of 10. Entropy feature as following formula: 


Nz0P,j(—InP,;) (5) 


2.5. Color feature 

A histogram counts the pixels number in each form and can be easily generated by reading each 
pixel of each image once and increasing the histogram's correct bin [13].The histogram features represent 
statistical-based features, whereby the histogram is often used as a representation of the distribution of 
likelihood of the image intensity levels [30]. The color histogram for an image is created by mapping input 
values in a smaller set of colors within the image from a large set of output values and counting the number 
of pixels of each color [31]. Scanning the image, setting color values to the histogram scale, and constructing 
the histogram using color attributes as indicators are easy processes for creating color histogram 
characteristics. For this study, to compute the color histogram is using the following parameters such as 
images, channels, mask, histogram size and ranges. 


2.6. Shape feature 

Shape is the primary source of information used to recognize objects. No visual content object can 
be properly recognized without shape. Moment invariants are essentially the region descriptors that are most 
popular and widely [32]. 


2.7. Random forest 

The random forest classifier is close to the top of the classifier rankings. Random forest could be 
used for classification as well as for regression. A random forest's cumulative prediction error is tightly 
correlated with individual trees ' intensity and density in the forest. Adding significant randomness in the 
base models, trees and creating subsets of the predictor system can refines bagging to separate the a tree 
nodes of random forest [33]. In the training phase, it automatically calculates the appropriate score for each 
element. After that it scales down the significance for the total of all scores to be 1. The overall explanation 
can be defined by using the Gini index. 


Gini = 1 — Yi-s@ii)? (6) 
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In addition, above equation includes the class and likelihood to decide which branch Gini is most likely to 
occur on a node. Thus, pi is the absolute frequency of the class that will be finds in the dataset, and c is the 
number of classes. The function of classification process is to classify image according to the type of nutrient 
deficiency and its accuracy percentage. In this research 


3. RESULTS AND DISCUSSION 

The testing result of the nutrient deficiency of maize leaf is divided into three types which are 
Magnesium, Nitrogen, Potassium and healthy shows in Table 1. The classifier shows the maximum 
probability for the class it predicts. Confusion matrix is used to calculate accuracy percentage for the overall 
system. The result of potassium detection effected by its similarity with magnesium features. 
Thus, potassium has the lowest number of correct detection as it mostly detects it as a magnesium as it has 
the closer features as magnesium. The accuracy achieve 78.35 percent as consequence of low detection on 
potassium. 


Table 1. Result of the identification 


Type Number of Tested Image | Number of Image Identified Correctly | Number of Image Identified Incorrectly 
Healthy 31 31 0 
Magnesium BP) 35 0 
Nitrogen 36 26 10 
Potassium 32 11 21 


Accuracy % = pe EE Ee ee 00 (7) 


Total images testing 


4. CONCLUSION 

This research classifies four type of class which is healty leaf, nutrient, magnesiuam and potassium. 
Some of the nutrient deficiency is likely to have the same trait, therefore it is difficult to classify these 
nutrient. More data will be collected in the future by adding more training data for classification process. 
Besides, the same dataset will be tested with other available machine learning to increase the current 
accuracy achieve by random forest classifier. 
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